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TECHNICAL MEMORANDUM X-53983 


FAST CARRY ACCUMULATOR DESIGN 
SUMMARY 


Methods for increasing the speed of binary addition by decreasing carry 
propagation time are reviewed. Particular attention is given to those methods 
that would be applicable to an accumulator using ones complements with end- 
around carry for subtraction. An iterative accumulator using pulse logic for 
input and carry signals is described. Realizations of gated carry , carry- 
completion detection, and carry-skip circuits that would be compatible with this 
accumulator are presented. NAND gates are used in the design of the required 
combinational networks. 


INTRODUCTION 

Digital techniques are finding application in closed-loop control systems. 
Sampling permits the control of large amounts of power by sensitive control 
elements and decreases the loading on sensing devices. The use of sampled 
data and digital components in control systems facilitates time sharing of 
portions of the system, which results in both economy and reliability in design. 
A digital compensator in a control loop is shown in Figure 1, where D(z) 
represents the transfer function of the compensator, G (s) is the transfer 

d. 

function of the system sensing element, and (s) is the transfer function of 
the system driving element. 


R(s) 



Figure 1. Closed-loop system with sampling [1]. 

A model of a digital compensator is shown in Figure 2 [ 1] , A second- 
order compensator transfer function may be described by [2] 






Sampler Hold 


Figure 2. Digital compensator [ 1 ] . 


e 2 (KT) =e 1 (KT) +a 1 e 1 (KT-T) + a 2 ei(KT-2T) - bje^KT-T) 


-b 2 e 2 (KT-2T) 


where T is the sampling period. 

The accuracy with which this equation actually describes the transfer 
function may be seen to depend upon the speed with which the computer performs 
the indicated arithmetic operations. If r is the time required to complete the 
computation, the output of the compensator is a function of KT + t. This r 
corresponds to a lag in a continuous system and, if large enough, could con- 
tribute to stability problems. 

Adder circuitry is essential not only for addition, but also for multipli- 
cation, subtraction, and division. The speed of the computational process is 
then heavily dependent upon that of the adder. In general, the basic design 
problem for arithmetic units is that of achieving high speed and accuracy at 
low cost. Size, weight, and sensitivity to environment are not generally of 
great importance in a general purpose computer. These factors, along with a 
greater emphasis on reliability, do become of great concern in the special 
purpose computers used in digital compensators. 

The most economical adder is probably the serial machine represented 
in Figure 3. Only one full binary adder (FBA) unit is required to sum the 
addend and augend words. A typical FBA stage is shown in Figure 4. However, 
only one element of each word is added in one computation time so the time for 
the addition of the two words (of equal length) is then nT where n is the 
number of elements in a word and T is the time required for each addition. 

The time required for the addition of the two words can be decreased 
by the use of a pseudoparallel or simple iterated adder as represented in 
Figure 5. Here the elements of the n-bit addend and augend are applied to n 
FBA’s instantaneously, and all partial sums (x^ © yj are formed during one 
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Figure 3. Simple series accumulator. 

addition time T. The addition is not completed, however, until all carries 
generated have been added to the partial sums in the succeeding stages. This 
carry propagation is essentially a serial process, and in the worst case a carry 
generated in the first stage would have to propagate through the entire length of 
the adder. 

It is thus recognized that the problem of increasing the speed of the 
addition process reduces, to a great extent, to that of decreasing carry prop- 
agation time. A considerable amount of work has been done in this area and 
several solutions have been proposed. 

REVIEW OF METHODS FOR DECREASING 
CARRY PROPAGATION TIME 

Several methods for increasing the speed of binary addition by decreasing 
carry propagation time have been presented by Sklansky [ 3] , MacSorely [ 4] , 
and Lehman and Burla [5]. Those methods and others that might be applicable 
to accumulators using ones complements with end-around carry for subtraction 
will be reviewed here. 
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Figure 5, Simple iterated (pseudoparallel) adder. 

The truth table for carry determination for two-summand binary addition 
is given in Table 1. 


TABLE 1. TRUTH TABLE FOR CARRY DETERMINATION 


c. 

1 

X. 

1 

y i 

C i+1 


0 

0 

1 

0 

Carry out determined solely by carry in. 

0 

1 

0 

0 


mm 

0 

i 

1 


Hi 

1 


1 


0 

0 

0 

0 

Carry out determined solely by the 

1 

0 



summands, x. , y. . 

0 

i 

1 

1 

i i 

1 

i 

1 

1 



Gated Carry 

Perhaps the simplest method for decreasing carry propagation time is 
that of gated carry. Table 1 [6] shows that the dependency of the carry out of 
a stage will fall into two categories; one in which the carry out is determined 
entirely by the carry in and one in which the carry out is determined only by 
the summands. Unlike summand bits in a stage may be detected and used to 
provide a path for carries from its preceding stage directly to its succeeding 
stage. In a stage with like summands, the above path is interrupted, and a 
one carry or zero carry is provided for its succeeding stage, depending upon 
whether the summands are both ones or both zeros. One such scheme using 
a 2-bit time addition is shown in Figure 6 [7]. A variation of this method 
using storage elements for the carry produced in each stage is called the stored- 
carry technique and is shown in Figure 7 [7]. 
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In either of these techniques, a carry entering the first stage would 
pass through two gates per stage through the n-i stage and through two gates 
th. 

in the n stage to be assimilated. The maximum carry propagation time is 
then 2(nd) , where n is the number of stages and d is the delay time per 
gate. 


Carry-Completion Detection 

In methods using fixed-time addition, time for the worst case propaga- 
tion through the entire accumulator plus a safety factor must be allowed. It 
can be shown that for the addition of numbers with a random distribution of 
ones and zeros, the average maximum carry length is less than log 2 n, where 
n is the word length [ 8] . For the propagation of both carry and no carry, the 
average length is log 2 5n/4 [9, 10]. The addition could then be speeded up if 
only the time needed for the actual carry propagation could be allowed. 
Gilchrist, Pomerene, and Wong [6] proposed a scheme to detect the presence 
of a carry and use its completion to initiate the next addition step. The princi- 
ple of carry-completion detection is illustrated in Figure 8. At the initiation 
of a carry operation, a one is inserted into the zero-carry line preceding the 
first adder stage. A one carry or zero carry is then gated or generated in 
the succeeding stages as determined by the inputs at those stages. The con- 
dition for a one carry is 


c. 1 = x.y. + c. • (x. © y.) 
i ii l-l i i 


and the condition for a zero carry is 


c.° = x.y. + c. * (x. © y.) 

l ii l-l ii 


( 1 ) 


( 2 ) 


Note that both the one- and zero-carry lines must be monitored so that a false 
completion signal will not be generated. One stage of an accumulator using 
carry-completion detection is shown in Figure 9 [7]. A method for evaluating 
the reduction in time achieved by the use of asynchronous addition techniques 
has been given by Hendrickson 1 10] , who arrived at a value of 90 percent 
savings for a 100-bit adder. 
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Figure 8. Principle of carry-completion detection. 

A NOR gate binary adder with carry-completion detection has been 
presented by Majerski and Wiweger [11]. One stage of their realization of an 
adder is shown in Figure 10. The Boolean formulas describing the circuit of 
Figure 10 are 


D. 

l 


= x.y. + x.y. 

li ii 


(3) 


D.c. 

i l 


D.c. 

i l 


x.y. + x.y. + x 

ii ii 


i 


-1* 


i-i 


x.y. + x.y. + x 
ri ii 


i-r 


i-i 


+ D 


i-i 



+ D. c. . 
i-i i-i 


(4) 

(5) 
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x - Accumulator Register Element 

y - Input Register Element 

Sum, X+Y, stored in Accumulator Register 


Figure 9. Carry-completion detection, one stage. 
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Figure 10. NOR gate adder stage [11], 


s. = D.c. + D.c. 

1 li li 


= D.c. + D.c. + x. y. 
i l ii l-l l- 


where 

x y (i = 1, 2, . . . , n) = bits of summands 
i i 

c^ (i = 1, 2, . . . , n+i) = carry from i-1 to i position 
s. (i = 1, 2, . . . , n) = i^ 1 bit of sum 



Figure 11 shows the adder carry and carry-completion detection circuits of 
Majerski and Wiweger [ 11] . An advantage of the adder carry circuit is that 
a carry must propagate through only one gate per stage, whereas two gates 
are required in other known designs. The Boolean formulas for the carry- 
completion detection circuit are 


H. = x.y. + x.y. + D.c.h + D.c.h 
1 li li ii ii 


i = 2, 4, 6, . . . , 2E (n/2) 


H 


H + H + H + 
2 4 6 


' ' + H 2E (n/2) 


(7) 


( 8 ) 


where h is the carry control signal. 

Operation begins with h = 1 and the add control signal. After at least 
2d, where d is the maximum propagation time of a NOR gate, h is switched 
to 0. The carry propagation process then begins and lasts until all D.c^ and 

D.c. gates stabilize. The h = 1 signal is applied to prevent a false carry- 

completion signal from being generated. The carry-completion signal may 
precede the real completion by up to d since only every second position is 
monitored. 


Carry-Skip Techniques 

Assuming the complements of both summands are available, an FBA 
stage may be represented as shown in Figure 12. The functions representing 
the sum and carry are then 


s i = x i © y t © c i 


(9) 


C i+l 


x.y. + 

i i 


x.c. + y.c. 

ii ii 


( 10 ) 
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( 11 ) 


( 12 ) 


ti = X i®^i = con( ^^ ori f° r transmission of a carry 


% = X i^- = con( ^ 011 ^ or generation of a carry. 


Then 

r = g. + t.c. = carry out of a stage. 

£ 1 HI 


The carry into any stage may then be expanded as follows: 


( 13 ) 

( 14 ) 

( 15 ) 
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C. = r. , 
i i-l 


ec + t • r 

*4-1 i-l 1-2 


sc + t • g + 1 » t ® r 

i-l i-l *4-2 i-l i-2 i-3 


sc „ + t « g + 1 • t * gv 

*4-1 i-l i-2 i-l i-2 *4-3 


+ Vl • ‘i-2 • ‘i-3’ r i-4 141 • 


(16) 


The expansion of r may be carried as far back as desired. The limit 
for an n-stage adder is an expression for c^ containing r^. 

Application of this principle to a section of an adder is illustrated in 
Figure 13 1 4], Allowing the i to take on successive values in equation (4) , 
and omitting all terms with negative subscripts, it is seen that each stage of the 
adder will require one i-input OR gate and i AND gates having one through i 
inputs. Thus the number of circuit elements would become prohibitive for 
adders of more than a few stages. However, the maximum carry path between 
any two stages is two levels and it is four levels for a complete addition. 

The adder can be broken up into groups of stages connected in the manner 
described. The carry into each group would be designated c^, the carry out of 

a group designated r , and the group containing the lower-order stages designated 

group 1. Assume that six stages would be a reasonable number of stages to be 
connected with full carry -look-ahead. If the six-stage groups are now connected 
in series with c^. =r^. a carry will require four levels to be generated, 

two levels to be transmitted through each intermediate group , and four levels 
to reach and produce a sum in the final group. For 6-bit groups then, the 
maximum carry path length would be 4 + (2n/6) , where n is the number of 
stages. For a 30-bit adder, this would be 14. This technique may be extended 
by providing carry -look- ahead circuitry between groups and even further by 
dividing the groups into sections and providing look-ahead between sections. A 
carry-look-ahead, or carry-skip, network for three groups is illustrated in 
Figure 14. 
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Figure 13. Three-bit adder group with carry prediction [4]. 
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Majerski [ 12] has proposed and implemented a NOR gate realization of 
a carry-skip circuit. For use of this circuit the skips may comprise only odd 
numbers of adder stages and their distribution relative to one another must 
always be an even number of stages. The adder with which this circuit is used 
is an asynchronous binary NOR gate implementation, of which the structure is 
given by the following equations : 

For odd adder stages. 


s. = D.c. + x. y. + D. ,c. , + x.y. + x.y. + D.c. , 

x 11 l-l 'i-i l-l l-l ri ri ii 


(17) 


D i°i = Yi + Vi + Yi y i-i + D i-i °i-i + h 


(18) 


where h is the carry control signal; 
for even adder stages, 


s. 

i 


D.c. + 

l l 


Wi-1 


D i-iY 


+ x.y. + x.y. + D.c. 
ri ri ii 


(19) 


D c 
ii 


x.y, + xy +x y 
11 11 1-1 1 “ 


D. c. 
1-1 1-1 


( 20 ) 


where 


D. = Yi + Yi 


( 21 ) 


Carry propagation begins with a change in h from 1 to 0 and ends with the 
stabilization of the states of all gates. 

The carry-skip circuit, comprising k stages from j + 1 to j + k of 
an adder, is described by the function: 
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= D.c. + x.y. 
* k 3 3 3 3 


+ W + 


+ X 


'j+k^j+k 


( 22 ) 


where j is an even number and k is an odd number. The carry and skip 
circuits for k * 3 are shown in Figure 15 [12]. 

Majerski f s equations for determining optimal and economical skip dis- 
tributions in adders using his design are presented in the Appendix. 


DESCRIPTION OF A SIMPLE ITERATIVE ACCUMULATOR 

The parallel binary accumulator shown in reduced block diagram form 
in Figure 16 has been used by the Auburn Research Foundation in the design of 
a digital compensator [ 2] . It was selected because of its automatic carry 
feature and relatively simple control circuitry. Its properties will be briefly 
described here and used as a basis on which to judge improvements in carry 
propagation time resulting from the application of fast carry techniques. 

The blocks labeled AC are T flip-flop storage elements of the accum- 
ulator register, and those labeled DM are delay multivibrators used for carry 
generation. The NAND gate pairs in the lower part of the figure form the 
augend-complementing structure. The input from the complementing-gate 
structure is a negative -going pulse when the input is positive. The negation 

til 

output of the DM is used. The input to the i -stage flip-flop (AC ) is then 


c. = y. + c. 

i i i 


(23) 


The output equation of the 


.th 

l 


flip-flop is 


x.' = t.x. + t.x. , (24) 

l ii ii 


where t. 

l 


is the input. 
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Figure 15 . Carry-skip circuit [ 12 ]. 
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The total addition for a stage takes place in two steps. In the first step, 



Then 


x i + i =y iV y i x i 


= y. © x. 


(25) 


The complement output of the flip-flop is the function 


y i @X i = y i 0X i * (26) 

th 

In the second step of the addition, the input of the i flip-flop is c.. The final 
output is then 

X l+2 = C i + c i % 

= c.©y i ©x. (27) 


The accumulate operation is initiated by the application of the accumulate 
pulse. The contents of each stage of the augend, or input, register are gated to 
the input of the corresponding stage of the addend, or accumulator, register. 
This operation is simultaneous for all stages within the limits of the variation in 
propagation time between NAND gates. If the augend bit is a logic one , with the 
sign bit a logic zero (positive) , the accumulator flip-flop changes state. Thus 
if a logic one from the augend stage is added to a logic one in the accumulator 
stage, the partial sum, a logic zero, is stored in the accumulator. The transi- 
tion of the accumulator stage from a one to a zero state triggers the delay 
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multivibrator and generates a carry , which is routed to the succeeding stage. 

If a logic one from the augend stage is added to a logic zero in the corresponding 
accumulator stage, the partial sum, a logic one, is stored in the accumulator. 

The transition of the accumulator stage from a zero state to a one state does not 
trigger the delay multivibrator, and no carry is generated. At this point the 
accumulator contains the results of the operation x. © y . . The carry generated 

in the preceding stage is then added to the contents of each stage of the accum- 
ulater, forming another partial sum and possibly generating other carries. The 
delay time of the multivibrator is such that all partial sums are formed before 
the carries are generated. A timing diagram for the operation of one stage of 
the accumulator is shown in Figure 17. The AC registers trigger when there is 
a change from the one to the zero level. Therefore , the trailing edge of the 
accumulate pulse is used as the zero-time reference. At the end of the accumu- 
late operation, the augend in the input register has been added to the addend in 
the accumulator register, and the sum has been stored in the accumulator 
register. 

Typical delay times of the elements used in the accumulator are given 
in Table 2 [13], The DM pulse width is set at .100 ns to assure operation of all 
circuits to which it is applied. It can be seen from the values in this table and 
by referring to Figure 16 that a maximum delay time of 120 ns is required from 
the end of the accumulate pulse to the formation of partial sums in the accumu- 
lator register. The time required to form a carry and transmit it to the suc- 
ceeding register is a 30-ns delay for the multivibrator, a 100-ns pulse width 
of the multivibrator, and a 30-ns delay for the input gate, yielding 160 ns. The 
total time for the generation and transmission of the first carry is then 280 ns. 
The time for the generation and transmission of each secondary carry (generated 
by the assimilation of a carry) would be 220 ns, since the delay time of the 
complementation gate and one input gate would then be omitted. The worst case 
carry propagation time for a 30-stage accumulator of this design would then be 
280 ns for the generation of a primary carry in the first stage, 6160 ns (28x220) 
for transmission through the intermediate stages, and 60 ns for assimilation in 
th 

the 30 stage, yielding 6. 50 ms. 

The relative cost of the circuit elements used in the accumulator is given 
in Table 3 [13] , using a 2- input NAND gate as a basis. Inverters will be con- 
sidered as 2-input NAND gates with the inputs common. A considerable variation 
in cost is found, but these figures should be realistic enough for comparative 
evaluation of designs. 

The cost figure for the portion of this accumulator dealing with carry 
propagation is then 630. 
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Figure 17. Timing diagram for carry generation and assimilation. 



TABLE 2 , LOGIC ELEMENT DELAY TIMES 


Logic Element 

Typical 
Delay Time 

Maximum 
Delay Time (ns) 

NAND gate 

25 ns 

30 

AC register; 



T input 

45 ns 

60 

DC input 

65 ns 

80 

Delay multivibrator; 



Assertion delay 


60 

Negation delay 


30 

Minimum 



Pulse width 

50 — 300 ns, 
300 ns — 3 /js 



TABLE 3. RELATIVE COST OF LOGIC ELEMENTS 


2-input NAND gate 


3- input NAND gate 2 

4- input NAND gate 4 

6-input NAND gate 6 

12-input NAND gate 12 

Flip-flop 8 


Relative Cost 


Logic Element 


Delay multivibrator 


12 
















RAPID ACCUMULATOR REALIZATIONS AND ANALYSIS 

The methods previously reviewed were investigated with regard to their 
application to the accumulator. This accumulator utilizes pulse logic for the 
input and carry signals. NAND gates were used in the design of combinational 
circuits because of their availability in integrated circuit form and their use in 
the basic accumulator. 

The techniques presented by Majerski and Wiweger [il, 12] offer faster 
propagation times than those given in the following sections when used with 
adders of their design, but were not directly applicable to this accumulator 
because of its use of pulse logic. 

Both one- and two-layer carry-skip methods were investigated. It was 
found that two-layer skips offered no appreciable speed advantage over one- 
layer skips for this accumulator. 

The results of the investigation are presented in the following sections. 


Gated Carry 

It is seen from Table 1 that the carry out of a stage depends solely upon 
the carry in when the summands are unlike. The function representing unlike 
summands is 


x. ©y. =x.y. + x.y. 
1 l il il 


The carry out of a stage depends only on the summands when they are both 
alike. The function representing like summands is 



Vi + Vi 


In the accumulator previously described, the above two functions are available 
th 

from the i -stage flip-flop of the accumulator at the true and complement 
outputs, respectively. The carry from the i^ stage is generated when the 
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complement output goes to the logic-one level. The functions necessary for 
applying the gated-carry technique presented are available. The assimilation 
of a carry in a stage for which the function x. © y is true will produce another 

carry, which poses no problem with level logic. In the accumulator, extraneous 
pulses would be inserted into the carry propagation line. The suppression of 
these extraneous pulses must be considered in applying the gated-carry 
technique. 

til 

Let r be the carry inserted into the carry propagation line by the i 
stage, and let 


c. 

x 


= r 


i-1 


( 28 ) 


Let g. ^ be the carry pulse generated by the condition x. ^y. ^ , which results 

when both summands of the i-1 stage are at the logic-one level. The function 

for c. must then be 
1 


c. 

1 


r i-2 





+ eg i-i 


( 29 ) 


where e is a carry-generation enable pulse. This pulse must be present for 
the generation of a carry in step one of the addition process, and absent to 
prevent the generation of an extraneous cariy when a carry is assimilated, 

A circuit to realize this c. function is shown in Figure 18. The equation 
for c. s written from the figure, is 


°i - r i-2 




eg 


1-1 
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gated carry. 













Applying DeMorgan f s theorem, 


c 9 

i 


= r 


i-2 


(y i-i® x i-i )+eg i-i 


9 


which is the desired result. 

The accumulation step is initiated with the accumulate pulse. If x^ 
and y. are logic ones, the AC complement output changes state from a zero to 

a one, triggering the DM and generating a carry pulse. The carry pulse enters 

the carry propagation line through the gate labeled 4 in Figure 18. If x. and 

1 

y^ are both at the logic-zero level, the complement output of the AC is at the 

logic-one level and does not change state and no carry is generated. This is 
the situation in which the carry out is determined solely by the summands. The 
true output of the AC is then at the logic-zero level and attains this level 120 ns 
before a carry is generated in the preceding stages. This level is then applied 
to the gate labeled 3 in Figure 18, blocking the transmission of any carry 

th 

generated in the preceding stages. A carry entering the i stage is routed 
through gates 5 and 6 to the AC input, causing a change in state. The assim- 
ilation of the carry then removes the transmission block, but does so 120 ns 
after the carry pulse has ceased to be present at gate 3. 

If x. and y. are different, x. ©y„ is true, and the true output of the 
l i i i 

AC takes on the logic-one level. If x. is one and y is zero, there is no AC 

state change. If y is one and x. is zero, the AC would change state from a 

complement logic-one level to a true logic-one level. In either case, no carry 
would be generated. The resulting situation is that the carry out is determined 
solely by the carry in. The true level is applied to gate 3, enabling the trans- 
mission of an incoming carry. A carry into the stage is then assimilated through 
gates 5 and 6, causing a state change of the AC. This removes the gating level 
to gate 3, 120 ns after the carry is transmitted. No new carry is generated by 
the stage because of the removal of the carry -generation enabling level, e, 
from gate 2. The enabling level, e, is generated by a DM, which is actuated 
by the accumulate pulse. The DM pulse width is set at 370 ns to gate all 
primary carries generated and to block all secondary carries generated. 
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A timing diagram for one stage of the accumulator is shown in Figure 
19, Assuming the necessary conditions, the time required to insert a carry 
into the propagation line after the end of the accumulate pulse is a 30-ns delay 
each for the complementing gate and gate 6, a 60-ns delay for the state change 
of the AC, a 60-ns delay for the DM, a 100-ns delay for the DM pulse width, 
a 30-ns delay for gate 2, and 30 ns for the insertion-gate delay, or a total of 
340 ns. There is a delay of 30 ns per gate per stage, or 60 ns per stage through 
the carry propagation line. It requires 120 ns to assimilate a carry in a stage. 
The equation for determining the total accumulation time for two binary numbers 
is 


A = (460 + 60 n>) 


(30) 


where n* is the number of stages through which a carry must be propagated 
before assimilation. The total time required to generate a carry in the first 
stage of a 30-stage accumulator, propagate it through the entire length, and 
assimilate it in the last stage is 


A = [460 4 60(28)] ns = 2. 14 jus 


The equipment that must be added to the accumulator previously described 
to facilitate carry gating consists of four 2-input NAND gates per stage and one 
DM unit to generate the carry -generation enable pulse, e. The cost figure for 
the portion of this accumulator concerned with carry propagation is then 756. 

The gated-carry design gives a speed increase of 6. 50 jus/ 2. 14 jus or 
3. 37 to 1 over the simple iterative accumulator. The cost increase is 21 percent. 


Carry- Completion Detection 

The gated-carry method produces an appreciable decrease in the maxi- 
mum carry-propagation time. To operate the accumulator on a fixed-time basis, 
maximum delays with an added safety factor must be considered, and the accum- 
ulation rate must always allow time for the maximum length of carry propagation. 
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Figure 19. Timing diagram for carry generation and assimilation (gated carry). 



I 


As pointed out previously, the average carry propagation length is 
log 2 n. This value is 4. 91 for an n of 30, so more rapid accumulation could 
be accomplished using a variable rate , with the rate controlled by the completion 
of carry propagation. Let r., i = i, 2, . . . , 30, be the carry pulse inserted 

1 th 

into the carry-propagation line by the i stage. Carry propagation is com- 
plete when a pulse is not present at any of the insertion points. The required 
condition for carry completion, D , is then 


D = rj • r 2 • r 3 r 30 (31) 

A carry-completion-detection circuit that could be used with the gated- 
carry accumulator is shown in Figure 20. From the figure, the carry-com- 
pletion signal is a positive level described by the equation 


D = d • r t * r 2 r 10 • d • r lt • r 12 r 20 


* d • r 21 • r 22 • 


r 3 o 


Applying DeMorgan's theorem gives 


D = d • r t • r 2 • • • • • r 30 


where d is the detection- enable level. This level is generated from the 
accumulate pulse by means of a DM and a T flip-flop. The delay of the DM 
is set to ensure that any carry being propagated will be present at the output 
of the gates labeled 5 before the detection system becomes operative. This 
prevents the generation of a false completion signal. The completion detection 
signal then resets the flip-flop in preparation for the next accumulate pulse, 
which will also be controlled by D. The timing diagram for this operation is 
shown in Figure 21. The overlap of carry signal at adjoining sensing points 
along the carry propagation line is shown in Figure 22 and is seen to be 40 ns. 

The timing diagram for the generation and assimilation of a carry is 
shown in Figure 23. The maximum delay times of circuit elements were used 
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Figure 21. Timing diagram for detection enable signal generation. 
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Figure 22. Continuity of carry signal at sensing gates. 

in defining this diagram. It is seen that 340 ns are required to insert a carry 
into the carry propagation line. The time required to assimilate a carry 
after it has arrived at a stage is 120 ns. The equation for accumulation time 
is 


A = (460 + 60n*) ns 


(32) 


where n 1 is the number of stages through which a carry must be propagated 
before it is assimilated. The time required to generate a carry and propagate 
it through the entire length of a 30-stage accumulator is 

A = [460 + 60(28)] ns 
= 2. 14 /tts 
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Figure 23. Timing diagram for carry generation and assimilation 
in carry-completion detection realization. 



The carry propagation complete signal is generated 210 ns after the 
last carry has ceased to be present at a sensing point, as shown in Figure 24, 
The final accumulator-register transition is completed 90 ns from that time. 
The difference, 120ns, must be added to the 2. 14 ms to yield the total maximum 
accumulation time of 2. 26 Ms. For a one-stage propagation, x. ^ would 

change state 460 ns after the end of the accumulate pulse. The total time for 
the accumulation step in this case is 460 ns + 120 ns = 580 ns. If no carries 
are generated in an accumulation, the limiting factors on speed are the gener- 
ation of the detection enable signal, d , and its propagation through the com- 
pletion detection circuit. The minimum accumulation time is 
300 ns + 210 ns = 510 ns. 

For the theoretical average propagation length of 4. 91 (or 5) stages, 
the accumulation time is 


340 ns for carry generation and insertion into carry propagation line 
+ 5 x 60 ns for propagation through 5 stages 
+ 30 ns for the delay of gate 5 
+ 210 ns for completion circuit delay 
= 880 ns . 


Compared with the accumulation time of 6. 50 ms which must always be allowed 
in the accumulator previously described, this is an increase in speed of 7. 39 
to 1 for the addition of a large number of summands with random bit distribution. 

The circuit elements that must be added to the gated-carry accumulator 
to implement the carry-completion detection feature are three 11-input NAND 
gates, one 3-input NAND gate, four 2-input NAND gates, two flip-flops, and 
one delay multivibrator. From Table 3 it can be seen that the cost figure will 
increase by 62, yielding a total of 818. This is a 29. 8-percent cost increase 
over that of the simple iterative accumulator. 
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Figure 24. Timing diagram for carry-completion signal generation. 



Carry-Skip Techniques 

The accumulator under investigation uses ones complements represen- 
tation of negative numbers with end-around carry, so a skip distribution using 
equal size groups is considered [6], The accumulator of n bits is divided 
into k groups of m bits each so that 


mk = n 


(33) 


The greatest carry propagation time results when the conditions for 
the summands are 


xi • yi 

(34) 

Xi (^) yi f i - 2 ? 3, . * • j n.— i j 

(35) 


and 


X30 ■ yso 

The time required for propagation is then 

t* = [1 + (m-i) + (k-2) + (m-i)l t. u. 
= [2m + k-3] t. u. 

= (2m + n/m-3) t. u. 


(36) 


(37) 


where t„u, is the propagation time of two NAND gates [6]. Differentiating 
equation (37) with respect to m and equating to zero yields 
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2-n/m 2 = 0 , 

m 2 = n/2 , 

m 2 = \/ n/2 , (38) 

m 2 = km/2 , (39) 

k = 2m 


for the minimum propagation time. For n = 30, the approximations m = 5 
and k = 6 are used. The group containing the lower-order bits is called 
group i. 

The maximum propagation time for a 30-bit accumulator is 


T » = [1 + (5-i) + (6-1) + (5-1)] x 60 ns 
= 780 ns 


The time required to generate a carry was 340 ns , and the time to 
assimilate a carry after it arrived at a stage was 120 ns. The maximum 
accumulation time for a 30-stage accumulator using one-layer carry-skips 
is then 


A = (460 + T 3 0 ) ns 
= 1. 24 ps 


Equations (11) through (16) will be rewritten here using notation for a 
five-stage group. Let 


r . = (x. . © y. .) c . + x. . ' y, . 
k, i k, x k? i k, x i k,x 


( 40 ) 
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be the carry signal produced by the i^ 1 stage of the group, where y 


and . are the summands of the i^ 1 stage of the group and c 
carry into that stage and 


k, i 


k, i 
is the 


c i • = r , 

k, l k,x-l 


(41) 


Let 



(42) 


be the condition for the transmission of a carry through a stage. This signal 

th 

is present at the true output of the k, i -stage accumulator register 110 ns 
after the accumulate pulse has ended (Fig. 23). 


Let 


S k, i Vi y k,i 


(43) 


th. 

be the condition for the generation of a carry in the k,i stage. This signal 

is generated by the k, i^ DM when the complement output of the k, i^ 1 
accumulator register becomes a one. Figure 23 shows that g . is generated 

280 ns after the end of the accumulate pulse. The carry produced by the k 
group is then 


k g k,5 + t k, 5* 

g k,4 + t k,5 

+ V 5 ’ V 4 ’ 

V 3 *V 2 

+ t 

• t 

t • t 

k, 5 

k, 4 

k, 3 k, 2 

+ t. 

* t 

t * t 

k, 5 

k, 4 

k, 3 k, 2 


g 


k, 3 


g 


k, 1 


Vi’ 


(44) 
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Equation (44) is satisfactory for carry levels, but with carry pulses 
100 ns in width, Figure 25 shows that for t 4 • t 2 • t 3 • t 4 • t 5 true, each c^ 

pulse into a group will result in two r fc pulses out of that group. For pulse 

carries, equation (44) must then be changed to 


r = (t 

• t * t. 

• t „ 

• t, ) 

k v Tc,i 

k, 2 k, 3 

k, 4 

k, 5 

+ ‘k, 5 * 

g k, 4 ‘ t k, 5 " 

Sc, 4’ 

g k,3 

+ *k, 5 ' 

t * t 

k, 4 k, 3 

g k,2 


+ Sc, 5 * 

t * t 

k, 4 k, 3 

Sc, 2 

«k.i> 

4* (t 

• t ■ t 

• t 

6 t ) 

k,l 

k, 2 k, 3 

k, 4 

k, 5 

= (t 

• t * t. 

• t 

• t ) 

k, 1 

k, 2 k, 3 

k, 4 

k, 5 

+ (t 

• t * t 

• t 

• t, ) 

k,i 

k, 2 k,3 

k, 4 

k, 5 


+ Sc,5’ Sc, 4 g k, 3 + Sc,5 Sc, 4 Sc, 3 Sc, 2 


Sc, 5 * Sc, 4 ‘ Sc, 3 ’ t k, 2 ' 


+ Sc,l Sc, 2 Sc, 3 Sc, 4 Sc, 5^ °k 


(45) 


The carries that must be produced by the stages within the group are described 
by the following equations: 


r k,i = V *M + g k.i 


(46) 
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Figure 25. Carry propagation through group with 



r k, 2 r k, 1 ' *k, 2 + g k, 2 

“V ‘ k ,i- *k,2 +g k,r v 2 +g k, 2 

r k,3 = r k,2' ‘k, 3 + g k, 3 

C k Sc, 1 Sc, 2 Sc, 3 +g k, 1 Sc, 2 ’ V, 3 

+ g k, 2 * *k,3 + s k,3 

r* = T* • f 4* P* 

k, 4 k, 3 k,4 5 k, 4 

= C k ’ t k, 1 ' t k, 2 * t k > 3 ‘ t k, 4 + S k, 1 t k > 2 

Vs” V4 

+ g k, 2 * t k, 3 ’ t k, 4 + S k, 3 * t k, 4 + S k, 4 
r k, 5 ^k 


( 47 ) 


(48) 


(49) 

(50) 


A reduced block diagram of the five-stage group using NAND gates 

to produce the combinational functions in equations (46) through (50) is given 

in Figure 26. To show that this circuit does satisfy the required conditions , 

DeMorgan's theorem is applied to equations written from the diagram. The 

carry -generation enable level, e , is a necessary condition for g, . as 

K, X 

explained in the section on gated carry, but will be omitted from the equations 
as it does not directly affect the argument. 


r k, 1 C k ’ t k, i ‘ g k, i 


C k * t k, 1 + g k, i 


44 




45 


Figure 26. Carry skip group k. 










r k, 2 r k, i Sc, 2 g k, 2 

= r k ,r V2 +8 k i2 

r k, 3 = r k, 2 ’ Sc, 3 ’ S k, 3 
r k, 2 Sc, 3 + g k, 3 

r k,4 _ r k,3 ‘ Sc, 4 g k,4 

= r • t +s* 
k,3 k,4 s k, 4 

= C k ‘ t k, 1 t k ! 2 * Sc, 3 ' Sc, 4 + g k, 1 * Sc, 2 
’ t k, 3 ’ Sc, 4 + g k, 2 ’ Sc, 3 ‘ Sc, 4 + g k, 3 
* Sc, 4 + g k,4 

It is seen that these equations are in agreement with equations (46) through 
(49). 

The carry produced by the fifth stage of the group is 


r 


k, 5 


r k g k, 5 ' Sc, 1 Sc, 2 * Sc, 3 * Sc, 4 * Sc, 5 


r ‘t *(t ’ t *t • t • t ) 

k, 4 k,5 k,l k,2 k, 3 k,4 k, 5 


’ C k * ( Sc, i * Sc, 2 * Sc, 3 * Sc, 4 * Sc, 
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g k, 5 * (t k, l" Sc, 2° Sc, 3 ° Sc, 4* Sc, 5* 

+ r k, 4 ' Sc, 5 (t k, 1 ‘ Sc, 2 ’ t k, 3 * Sc, 4 ’ t k, 5^ 
+ °k ^Sc s 1 Sc, 2 Sc, 3 Sc, 4 Sc, 5^ 


It must be shown that 


Sc, 4 ‘ Sc, 5 * (t k, 1 ’ t k, 2 * Sc, 3 * Sc, 4 * Sc, 5* 


(t k, 1 * Sc, 2 * Sc, 3 * Sc, 4* Sc, 5^ ’ (g k, 4 ’ t 5 


S k, 3 * Sc, 4 * Sc, 5 + S k, 2 ’ Sc, 3 ’ Sc, 4 * Sc, 5 
S k, 1 * Sc, 2 " Sc, 3 * Sc, 4 Sc, 5* 


We see that 


Sc, 5 ' r k, 4 C ' (t k, 1 * t k, 2 ‘ t k, 3 * r k, 4 * t k, 5 } 
+ g k, 4 ’ Sc, 5 + g k, 3 * t k, 4 * Sc, 5 


+ g k, 2 Sc, 3 Sc, 4 Sc, 5 
+ g k, 1 * Sc, 2 * Sc, 3 ’ Sc, 4 * Sc, 5 
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Then 



+ (t, 


k, 1 


S, 


k, 3 


Sc, 2 * Sc, 3’ Sc, 4 * Sc,5^ * (g k,4’ Sc, 5 
Sc, 4 Sc, 5 + g k,2 Sc, 3 Sc, 4 Sc, 5 


+ g k, 1 ’ Sc, 2 Sc, 3 ” t k, 4 ’ Sc, 5^ 


9 


which is the required result. 

It is necessary that the proper gating levels be at gates 2, 3, and 13 
of Figure 26 before a carry pulse reaches them. Figure 27 shows that the 
blocking level will be present 50 ns before g reaches gate 2 and 90 ns before 

Kj U 

r^ ^ reaches gate 3. The gating level will be present 80 ns before c^ reaches 
gate 13. 


The 30-stage accumulator is made up of six groups of five stages 
with c^ = r^ ^ , except that c^ = r g , the end-around carry. This is shown 

in Figure 28. 

The accumulator using carry skips yields a speed advantage of 6. 50 
6. 50/ 1. 24 jus or 5. 21/ 1 over the accumulator previously described. 

The components that must be added to the gated-carry circuit to 
facilitate carry skips are one 6-input NAND gate, one 2- input NAND gate, and 
one inverter for each group. In every fifth stage, two 2-input NAND gates must 
be replaced with 3-input NAND gates. The cost figure for the 30-stage accumu- 
lator with carry skips is then 816. This is an increase of 29. 5 percent over the 
cost of the simple iterative accumulator. 
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Figure 27. Gating level timing diagram. 
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Figure 28. Six-group representation. 







DISCUSSION AND CONCLUSIONS 


The relative cost figures and accumulation times for the simple 
iterative accumulator and accumulators using fast carry techniques are 
presented in Table 4, It is seen that all three realizations investigated offer 
a considerable decrease in accumulation time without too great a difference in 
cost figure. 


TABLE 4, ACCUMULATOR COST AND TIME REQUIREMENTS 


Accumulator Design 

Cost Figure 

Max Ac cum 
Time (/us) 

Avg Accum 
Time (/us) 

Simple iterative accumulator 

630 

6. 50 

6. 50 ! 

Accumulator with gated carry 

756 

2. 14 

2. 14 ; 

Accumulator with carry- 




completion detection 

818 

2. 26 

0. 88 

Accumulator with carry skips 

816 

1. 24 

1. 24 


The comparison may be formalized by the use of a criterion similar 
to that used by Burla [5] for parallel adders. The criterion is 


Q = t/t„ • F/F 0 • 100 


(51) 


where 


t = accumulation time of the circuit under consideration 
t 0 = accumulation time of the iterative accumulator 
F = cost figure of the network under consideration 
F 0 = cost figure of the iterative accumulator. 
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The results of applying this criterion are shown in Table 5. Using 
this criterion, the circuit for carry-completion detection appears best. How- 
ever , the control circuitry for it is more complicated than for the others con- 
sidered. The average accumulation time was calculated using carry propagation 
lengths resulting from the accumulation of numbers with random bit distribution. 
If this distribution is not random , the accumulation time for the carry-com- 
pletion detection circuit could be greater than that for the carry-skip circuit. 


TABLE 5. COMPARISON OF ACCUMULATOR DESIGNS 


Accumulator Design 

t 

(ms) 

t/t 0 x 100 

F 

F/F 0 

Q 

Simple iterative accumulator 

6. 50 

100 

630 

1 

100 

Accumulator with gated carry 

2. 14 

33 

756 

1. 2 

40 

Accumulator with carry- 

(Avg) 

14 

818 

1.3 

18 

completion detection 

0. 880 





Accumulator with carry skips 

1. 24 ' 

19 

816 

1. 3 

25 


In some applications the gated-carry circuit might be desirable. If 
the accumulation time for it is acceptable , it has the advantage of having fewer 
components. 

The gated-carry circuit offers the fastest accumulation time for 
fixed-time operation. In general, the carry-completion detection circuit would 
be recommended for application when a variable accumulation rate is acceptable; 
the carry-skip circuit would be recommended when a fixed accumulation rate 
is desired. 
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DETERMINATION OF OPTIMAL AND ECONOMICAL 
SKIP DISTRIBUTIONS [12] 


A discussion of Majerski's methods for determining optimal and 
economical skip distributions in an adder with a maximum possible number of 
stages for a given maximum allowable propagation time will require the intro- 
duction of several definitions and notational conventions. One-layer skip dis- 
tribution occurs when adder stages are included in carry skips , and every stage 
is included at most in one skip. 

Two-layer skip distribution occurs when adder stages are included in 
two adder skips, and every stage is included at most in two skips. If two skips 
include common stages, all stages contained in the shorter skip are included in 
the longer skip. The longer skip is called first-layer skip, and the shorter one 
is called second-layer skip. If none of the stages in a skip are contained in 
another skip, the skip is treated as a first-layer skip. 

An adder with one-layer skip distribution is divided into groups, each 
of which includes all stages in a skip circuit or a single stage not contained in a 
skip. 


An adder with two-layer skip distribution is divided into sections, 
which include all stages composing one first-layer skip or a single stage not 
included in skips. A section is divided into groups. A group includes all stages 
in a second-layer skip or a single stage in a second-layer skip. Sections and 
groups are numbered successively, starting with the least significant bits. 

The notation used is: 

n = Number of adder stages 

m = Number of groups in an adder with one-layer skip 
distribution 

th 

k., (i = i* 2,. . ,m) = Number of stages of the i group of 

an adder with one -layer skip distribution 



p = Number of skips in an adder with one-layer skip 
distribution 


M = Number of sections in an adder with two-layer skip 
distribution 

th 

K., (i = 1, 2, . . . , M) = Number of stages in the i section of 

an adder with two-layer skip distribution 

P = Number of first-layer skips in an adder with two-layer 
skip distribution 

th 

m., (i = i, 2,. . . ,M) = Number of groups in the i section 

of an adder 



(i = i,2,. . . 


M); j = 1,2,.. 


m.) = Number of stages in the 

\th .th 

j group of the l 

section of an adder 


p., (i = 1,2,. . . ,M) = Number of second-layer skips in the i 

section of an adder 


.th 


d = Maximum propagation time of a NOR gate 


T = Maximum carry propagation time (mcpt); the greatest 
carry propagation time for all possible combinations of 
adder input variables. 


The optimal skip distribution in an n-stage adder is defined as 
occurring when, for a given n, T is minimal and the number of skips is mini- 
mal. The economical skip distribution is defined as occurring when, for a 
given n and T, the number of skips .is minimal. 

The mean carry propagation time (mcpt) in the NOR gate adder with 
one-layer skip distribution is given as: 


T = maximum (T!,T 2 ), where 

Tj = maximum [ (k - 2) + (/3 - a - 1) + k ] (A-i) 

«,/3 “ P 
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(A-2) 


To = (m - 1) + maximum (k - 1) 

& Q, 

a 


and where the a , /3 indexes of groups satisfy the conditions 


i — Q! — m, cu < (3 < m + at and k . + k. (A-3) 

m+i i 

The terms (k^ - 2), k., and maximum (k - 1) denote maximum times of 
up a u 

carry propagation through positions of the groups in which the carry propagation 
begins and ends. The -2 in the term (k^ - 2) is due to the carry propagation 

time beginning with the change of the carry control signal rather than at the 
change of states of the adder inputs. 

For the optimal skip distribution in an n(T) -position adder, the 
maximum number of adder positions is the function of mcpt, T^ 5, given by 
the formulas 


v 


T 



2 


n 


(T) 



(T + 7) 2 ' 
32 


(A-4) 


(A-5) 


where 


is defined by equation (A-4). 


For odd values of T for which T/8 does not produce a remainder 
equal 3, the optimal skip distribution is the one in which the numbers of stages 
in groups are k and 1 alternately. For this case , the following formulas 
apply: 



(A-6) 



T + 3 
4 


(A-7) 
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m = T - 2k + 5 (m . . . even number) 


(A-8) 


For a 36-stage adder, the values given by these formulas are a minimal value 
of T = 17 and a skip distribution of stages as 5, 1, 5, 1, 5, 1, 5, 1, 5, 1, 

5, 1. 


For the values of T such that T/8 gives a remainder of 3, the 
optimal skip distribution is the one with the numbers of stages in successive 
fours of groups equaling K-2, 1, k, 1, In this case, the following formulas 
are used: 


k = 2E 


t 

2 


[ K "+ 1 + 
\ T 



- 1 


(A-9) 



T + 7 
4 


(A-10) 


m = T - 2K + 7 (m . . . number divisible by 4) . (A-ll) 


To determine an economical skip distribution, the distributions for 
two adders are computed and the one with the smaller number of skips is 
chosen. 


The first adder, with k and 1 stages in groups alternately, is 
determined by formulas (A-4), (A-6) , (A-7), and (A-8), with n replacing 
(T) 

n in (A-6). An adder with (m/2) (k + 1) stages is obtained. If (m/2) 
(k + 1) is greater than n , (m/2) (k + 1) - n stages are canceled. This 
should be done in pairs in groups and preferably in such a way as to cancel 
one skip. 


For the adder with k-2, 1, k, 1 positions in successive fours of 
groups, formulas (A-4) , (A-5) , (A-9), (A-10), and (A-ll), with n replacing 
(T) 

n in (A-10) , apply. The number of stages obtained is 


m* (k - 1) + m" (k + 1) 
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where 


m* 



m" = E (m/4) 


(A-12) 


The adder with the smaller number of skips is then the one with economical 
skip distribution. 

The mcpt for an adder with two-layer skip distribution is given by 


T = maximum (T^ T 2 , T 3 , T 4 ) , 


where 


Tj = maximum 



2 ) 



+ (lj> - $ ~ i) 


+ 


[O - 1) +k 



(A- 13) 


1 < 0 < M, $ < ip <M + <p, 1 ^ a — m^, 1 ^ /3^m^, and j 


k.. 

i] 


(A-14) 


T 2 = maximum 

<p>,CL,p 


[<V - 2) + V 


(A-15) 


1 < 0 <M, 1 < O' < /3 < m^ (A-16) 

T 3 = (M - 1) + maximum j(m , - i) + (k , a - l)j (A-17) 

(^,0! 9 
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(A-18) 


1 < 0 < M, 1 — Q! — m , 

T 4 = (M - i) + maximum! (o; - i) + k , 1 

l L *“ J 

+ [^<#>(3 - 2) + % - ">] 

!<</>< m, 1 < a < /3 < m^ 


(A-19) 

(A-20) 


The methods presented here apply exclusively to the NOR gate adders 
with an even number of stages, in which T always takes odd values. 

In an adder composed of an even number of M sections, having K and 
1 stages alternately, with identical skip distributions in multistage sections, 

T may be expressed as: 


T = t + M - 3 (t an even number) 


(A-21) 


where t is a function of the number of stages and of the skip distribution in a 
section. Formula (A-21) defines t . 

Optimal skip distribution in a section occurs when, for a given t , the 

number positions in a section is a maximum and, with maximum K ^ 

the number of skips in the section is a minimum. The parameters for such a 
distribution are given by the following formulas: 


m 


(t) 


m 


(t) 



if t 2= 16 
if t < 16 


(A-22) 


(A-23) 
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X 


k. 

J 


(t) 


l[ i+( - i)J 

minimum [2E - j , 2 e (* ~ ^ — — ) + J ] 


for j = 1,2,. ... ,m 


(t) 


K (t) = 2E 


( 2t - 3m ^ + 2t + 6m ^ 
16 


+ 1 


(A-24) 

(A-25) 


The parameters m^,k. take exclusively odd values. 

3 


(T) 


Majerski found for T = 13, t = 8, t. = 8, 1, 8, 1, 8, 1, 8, 1, n = 32, 

1 (T) 

and for T = 15, t = 10, t. = 12, 1, 8, 1, 12, 1, 8, 1, n = 34. 

(T) 

The optimal skip distribution in an n - position adder with the max- 
imum number of positions for a given T s: 9 may be determined as follows: 


2T - 6 < < 2T + 9 

3 ~ _ 3 


(A-26) 


M = T - t + 3 


(A- 27) 


For t = 6 and t = 8 , 12, 16, ... , the adder is made up of sections 
of and 1 stages alternately. The number of adder stages is 





(A-28) 


For t=10, 14, 18, successive adder sections are taken according 

to the parameters given by the terms of sequences 
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t+2, 1, t-2, 1, t+2, 1, t-2, 1, . . . , t+2, 1, t-2, 1 


(A-29) 


if M is a number divisible by 4 and 

t+2, i, t-2, 1, t+2, 1, t-2, 1, . . . , t+2, 1, t-2, t, i (A-30) 

if M is a number not divisible by 4. 


The 1 terms in formulas (A-29) and (A-30) correspond to one-stage 
sections. A cyclic change of adder stages is permitted. The number of adder 
stages is then 

From among the skip distributions for all t obtained from formula 

(T) (T) 

(A- 26), the one with maximum n stages and for n with minimum 
number of skips is chosen. 

For an economical skip distribution for an n-stage adder with t ^ 9, 
the skip distribution is determined for two adders with 


t = 2E (T/3) , 2E (T/3) + 2, 2E (T/3) +4, ... 

In the first adder, accept and 1-stage sections alternately. The 

number of adder sections and the number of adder stages are given in formulas 
(A-27) and (A-28). 

In the second adder (only if ts 8), accept the same number of adder 
sections, but for successive sections use instead of t the successive terms 
of the sequences 


t+2, 1, t-2, 1, . . . , t+2, 1, t-2, 1 
if M is a number divisible by 4, 


(A-32) 
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or 


t+2, 1, t-2, 1, t+2, 1, t-2, 1, . . . , t+2, i, t-2, 1, t, 1 (A-33) 

if M is a number not divisible by 4 and t = 10, 14, 18, . . . 
or 

2, 1, t+2, 1, t-2, 1, t+2, 1, . . . , t-2, 1, t+2, 1, t, 1 
if M is a number not divisible by 4 and t = 8, 12, 16 . . . 


Such skip distributions are determined for at least three successive 
integers t and for further t until both numbers n' and n" are smaller than 
n . Then for all skip distributions in the adders for which n* > n or n" > n, 
cancel n' -n or n" -n stages to cancel the greatest number of skips. From 
among the obtained skip distributions of n -position adders, the one with 
the minimum number of skips is chosen. 
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